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Abstract 

Suppose Alice has a coin with heads probabihty q and Bob has one with heads 
probabiUty p > q. Now each of them will toss their coin n times, and Alice will win iff 
she gets more heads than Bob does. Evidently the game favors Bob, but for the given 
p, q, what is the choice of n that maximizes Alice's chances of winning? We show that 
there is an essentially unique value N{q,p) of n that maximizes the probability f{n) 



< N{q,p) < 



max {l—p,q) 



that the weak coin will win, and it satisfies 

The analysis uses the multivariate form of Zeilberger's algorithm to find an indicator 
function Jn{Q,p) such that J > iff n < N{q,p) followed by a close study of this 
function, which is a linear combination of two Legendre polynomials. An integration- 
based algorithm is given for computing N{q,p). 
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1 The problem 



Suppose Alice has a coin with heads probabihty q and Bob has one with heads probabihty 
p. Suppose q < p. Now each of them will toss their coin n times, and Alice wins iff she gets 
more heads than Bob does (n.b.: in case of a tie, Bob wins). Evidently the game favors Bob, 
but for the given p, q, what is the choice of n that maximizes Alice's chances of winning? 

Interestingly, there is a nontrivial (i.e., in general > 1) unique value of n that maximizes 
her probability of winning. For example, in the case p = 0.2, q = 0.18, Figured] is a plot of 
Alice's win probability as a function of n. 
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Figure 1: Probability that Alice wins vs. n. 

In this example, if each player flips their coin 26 times, which is the best choice for her, 
Alice's chance of winning will be about 0.36, compared to a chance of 0.14 if each coin is 
tossed only once. 

In general, her chances of winning are 

r>0 ^ ^ s>r ^ ^ 
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This problem, which first appeared in [B], arose from a consideration of real- world events in 
the National Football League, where teams play a season of 16 games and do not play all 
other teams. If teams A and B have probabilities q and p > q, respectively, of winning any 
game and never play each other, one can wonder about the chance that A's season record 
will be strictly better than that of B. That is easy to answer, but then one is led to the 
question of whether the season length, 16, is favorable or not to such an outcome and what 
the optimal choice would be. A study of a related topic, where the central issue is the chance 
that the underdog beats a certain point-spread, has been published by T. Lengyel p]. 

We will also give, in section [10] an algorithm that uses repeated numerical integration to 
compute the optimum value N{q,p). Mathematica code for various computations, graphics, 
and algorithms (e.g., the generation of graphs of p„ or computation of N{q,p)) is available 
in the electronic supplement at [7]. 

Much of the work here has relied on computing power, both for numerical experiments 
and for proofs using symbolic computation. Some sophisticated algorithms in Maple and 
Mathematica (Zeilberger's MultiZeil, and cylindrical algebra reduction of polynomial sys- 
tems) played crucial roles; without them the discoveries and proofs would have been difficult, 
if not impossible, to find. 

1.1 Acknowledgments 

Professor Bruno Salvy, of INRIA, France, has kindly supplied to us some highly refined 
asymptotic results, which were of great assistance in this work. We thank Rob Knapp and 
Tamas Lengyel for some helpful discussions. 



2 Overview of methods and results 

It develops that there is, in this problem, a nice indicator function Jn{q,p), which is simply 
a linear combination of two consecutive Legendre polynomials, with the property that the 
sign of f{n + 1) — /(n) is the same as the sign of Jn{q,p)- We will find this indicator by 
using the multivariate form of Zeilberger's algorithm pj. We will then show that for small 
n, J is positive and for large enough n, J is negative, and that there is only a single integer 
value of n, or a consecutive pair {n,n + I), at which the sign of J changes. Thus / has a 
unique maximum, at n = N{q,p), say. Here is the precise result. 

Theorem 1. With f{n) defined by (J^ we have 

= + ^(1 + ^y)) <Pn{xy) - \<Pn,.{xyl (2) 
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where x = p/{l — p), y = q/{l — q), and 

and Pn{t) is the classical Legendre polynomial. Therefore the indicator function 

Jn{q,v) = [y + ^(1 + xy)j (t)n{xy) - ^0„+i(x?/) 

has the desired properties. 

We remark that, once found, the recurrence ([2]) can be proved directly, i.e., without 
Zeilberger's algorithm, with little difficulty. 

Next, in section Owe will prove uniqueness of and find upper and lower bounds for A^(g, p) 
by using various properties of the Legendre polynomials and by a close study of a function 
Pn{,q) which for each q G (0, n/ {2n + 1)), is the unique value of p for which Jn{q,p) = 0. The 
properties of the curves p = Pn{q) in the (p, q) plane play crucial roles here. First, concerning 
uniqueness, we have 

Theorem 2. (Unimodality) Given probabilities p > q with p + q ^ 1, there are either one, 
or two consecutive, values of n such that 

i- f{n) > f{n — 1), and 

2. f{n + l) < f{n), and 

3. at least one of the above two inequalities is strict. 

Definition. Given q < p, let N{q,p) be the value of n that maximizes f{n,p,q). When 
the value is not unique, define to be the smaller of the two possible values that yield the 
maximum. 

It follows from Theorem [2] and the definition of that N{q,p) is the smallest integer n 
such that Jn{q,p) < 0. 

The resulting upper and lower bounds for N{q,p) are given by 

Theorem 3. If N{q,p) is the choice of n that maximizes the probability that the player with 
the weaker coin will win (and with ties going to the lower value) we have: 

^- <Nil^P)' hutifp + q^l, then + \\ <N{q,p). 



2. N{q,p) < 



max (1— 
p~q 



Section [9] contains proofs of various properties of the graphs of p„, and they are used to 
obtain improvements to the upper and lower bounds on A^. 
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2.1 Definitions and notation 

The heads probabihties of the two coins are p and q, with p > q. We write x = p/{l — p), 
y = Q/i^-ci), z = xy, u = {l + z)/{l-z) = l + 2pq/{l - p - q), p = {I - p + q) / (I - p - q). 
Further, P„ is the nth Legendre polynomial and r„ = r„(u) = Pn{u) / Pn-i{u) . The P„'s 
satisfy the well known recurrence 

Pn+l{u) = —uPn{u) —Pn-l{u). (4) 

n + 1 n + \ 

If we divide through by Pniu), we obtain the ratio recurrence 

2n + l n 

rn+i{u) = —u - 5 

n+1 {n + l)rn{u) 

which will be of use in the sequel. 

The indicator function Jn{(l,p), which has the sign of /(n + 1) — f{n), is 

Jnil^P) = y<Pn{z) - i)n{z) = " _ _ ((1 ' P + g)fn(M) - {I - P - q) Pn+l{u)) , 

where 0„ is given by ([3]) ancQ 



^,^,^^,{z)-{l + z)Mz)) (6) 

T will denote the interior of the triangle in the {p,q) plane whose vertices are (0,0), (1,0), 
(|, 1). Tn will be the open interval {0,n/{2n + 1)). The line Lk{q) in the (p, g) plane is the 
line p = + q, and the line M„(g) is 

1 7i 

72 + 1 n + 1 

3 Finding the indicator function 

Our first task will be to find a recurrence for /(n). To do this we will use the multivariate 
form of Zeilberger's algorithm, MulZeil [T]. As usual the results that are returned by the 
algorithm can be easily verified by substitution. 



^For a quick proof of dH]), square both sides of the Pascal triangle recurrence. 



6 



Remarkably, this recurrence will show that f{n+l) — f{n) is simply expressible in terms of 
Legendre polynomials; this will enable us to identify the values of n for which f{n + l) > f{n) 
and those for which f{n + 1) < f{n). 

In view of eq. ([2]), Alice's probability of winning increases with n as long as 

y(t)n{xy) - tpnixy) = (^y + ^(1 + xy)^ <pn{xy) - ]^<pn+i{xy) (8) 

is positive, and decreases otherwise. We will show that for fixed x, y there is a unique value 
of n at which this function changes its sign. 



4 Finding the recurrence for f[n) 

In this section we will find the recurrence that is satisfied by /(n), the sum in eq. ([1]), using 
the multidimensional version of Zeilberger's algorithm. This will prove Theorem [H 



4.1 Finding the recurrence for the summand 

With 

x = p/{i~p), y = g/(i-g), ^?H = /H/((i-pr(i-gr), (9) 

the definition ([1]) of /(n) becomes 



r>0 s>r ^ ^ 



x'y" 



Let G{n,r,s) = {^){^)x'^y^, be the summand. We use Zeilberger's algorithm, and his pro- 
gram MulZeil returns a recurrence 

G{n+l,r,s) - {x + l){y + l)G{n,r,s) = (fsT^ - l)(ci(n, r, s)G(n, r, s)) 

+ iK,-l){c2{n,r,s)G{n,r,s)), (10) 

where Kr, Kg are forward shift operators in their subscripts, and the q are given by 

r{l + y) s{n + l) 

ci = ci(ra,r, s) = -; C2 = C2(n, r, s) = — — -. (11) 

r — n — 1 [s — n — i)[n — r + 1) 

This is the recurrence for the summand, and it can be quickly verified by dividing through 
by G{n, r, s), canceling all of the factorials, and noting that the resulting polynomial identity 
states that = 0. 
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4.2 Finding the recurrence for the sum 

To find the recurrence for the sum, we sum the recurrence ffTOj) over s > r, and then sum 
the result over r > 0. To do this we have first, for every function (p of compact support, 

Y.^K,. - l)0(r, s) = -Y, 0(0, s) + Y, 0, (12) 

r>0 s>r s>l r>l 

and 

Y.^K, - l)0(r, s) = -Y, ^{r, r + 1). (13) 

r>0 s>r r>0 

Consequently, if we sum the recurrence (ITU]) there results 

g{n + 1) — {x + l){y + l)g{n) = — ci(n, 0, s)G{n, 0, g) + ci(n, r, r)G{n, r, r) 

s>l r>l 

— C2{n, r, r + l)G(n, r, r + 1). 



r>0 



Next we insert the values, from f ITT]) . 

Cifra, 0, s) = 0; Ci(n,r, r) = -; C2(n, r, r + lj — 



r— n— l' ' ' (r — ri)(ri — r + 1) ' 

which gives 

+ G(n.r.r)-V + DC" + D 

r — ?7, — 1 

r>l r>0 

Now substitute the values G{n,r,r) = (")^(m)'', and G{n,r,r + 1) = {^){^^'^)x'^y^^^, and 
simplify the result, to obtain 

2 



g{n + l)-{x + l){y + l)g{n) = ^ '\ G{n, r,r)-J2 f \/ ^' ^ + l)' 



g(n + l)-ix + l)iy + l)gin) = ^ ^1 + v) M ^^^y 

r — n — l\r/ 
r>l ^ ^ 

y-v (r + l)(n + l) [''^\ f 



{r — n){n — r + 1) \r J \r + 1 

-(!/+i)e(,:i)(:)-'"«'^'+i: 

r>0 ^ / \ / 

y(j)nixy) - ^nixy), 



r r+1 

I 7' — n 1 1 — 'r -I- I I \ 'r / \ 7' + I / 

n + 1\ / n 
r J \ r 



r r+1 

X y 
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Next, replace g{n) by noting that = l/((l-p)(l-g)), 

to get the final result, namely that the recurrence for f{n) is 

f{n + 1) - f{n) A f \ I f \ 1 A f PI \ I f PI 



(14) 

It is easy to find the generating function of the sequence {/(^)}J^o from the recurrence 
It is 

y finr = , ' . [ 1 - ^-i^-P^^)i \ . (15) 

^ 2(l-t)l^ V(l-(l-p-g)t)2-4pgt; 

Corollary 1. (Symmetry) For allp, q we have f{n,p, q) = f{n, 1 — q, 1—p), and consequently 
for all < q < p < 1 we have 

N{q,p) = N{l-p,l~q). (16) 



Proof 1 . For a first proof, replace p by 1 — g and g by 1 — p in the generating function (fT5|) , 
and check that it remains unchanged. □ 

Proof 2. For a more earthy proof, Alice's winning of the {q,p) game means she had more 
heads. This is identical to Bob's having more tails. That occurs when Bob wins the tails 
game where he has a coin that comes up tails with probability 1—p and Alice has a coin 
that comes up tails with probability 1 — q. The probability of the latter is f{n, 1 — p,l — q) 
while the former happens with probability f{n,q,p). □ 



4.3 Remarks on the identity ([2]) 

The identity in equation ([2]) relates two different forms of the function f{n), namely the 
form ([1]), of its original definition, and the form on the right side of (|2]). Our first comment 
is that although the identity was discovered by Zeilberger's algorithm, it can be given a 
straightforward human proof, which we will now sketch. 

First, if we substitute ([T]) into the left side of ([2]) it takes the form 

EE{cr)("r)-(-^)(-K:)(:)}-^=^ 

when we express it solely in terms of the variables x and y. 

Now by inspection of the right side of ([T]) we see that only terms x"?/^ appear in which 
6 — a = or 1. Thus to prove the identity we might show that all monomials x"'y'^ on the left 
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side, i.e., in f ll7p above, vanish if 6 — a 7^ or 1, and that the remaining terms agree with 
those on the right. We omit the details. 

Our second comment is that from the identity ([2]) we can find a new formula for f{n) 
itself, the probability that Alice wins if n tosses are done. To do this, multiply both sides of 
([2]) by the denominator on the left, and sum over n. The left side telescopes and we find 

^ n — 1 

f{n) = -(1 _ (1 _ p - g)"P„(n) - (p - g) ^(1 - p - g)'=Pfc(n)), (18) 

fc=0 

in which, as usual, we have put u = 1 + 2pq/{l — p — q)- This formula is well adapted to 
computation of f{n). Let's define 

^ n—l 

Yn = + ^(1 - p - q)'Pk{u). 

l-p-q 

Then it's not hard to show that satisfies the recurrence 

(n-l)F„ = (3p + 3g-6pg-4 + n(3-2p-2g + 4pg))r„_i 

-(7p - 2p^ + 7q- lOpq - 2g^ - 5 + n{3 -Ap + p^ -Aq + 6pq + g^))F„_2 
+ (^_2)(l-p-g)=^y;_3, in>2), 

with Y^i = 0, Yq = 1/(1— p — g), Yi = 1 + 1/(1— p — g). Using this recurrence in f|T8|) is the 
only way we know to compute accurate values of the probability when n is large. 

5 Proof of the unimodality theorem 

In this section we will prove Theorem [2], the unimodality theorem for the optimum value of 
n. 

According to ([H]), we have /(n + 1) > f{n) precisely for those n such that y(f)n{xy) — 
i^nixy) > 0, i.e., as long as 

1 \ 1 

y + -{l+ xy) J (t)n{xy) - -0„+i(a;t/) > 0, 

or equivalently, as long as 

, / X < 1 + (a: + 2)y, 
(Pn[xy) 
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or 

p -, f l±m\ 

(1 - xy) ^^-4^ < 1 + (x + 2)y. (19) 



1-xy 

First suppose that xy < 1, i.e., that p + q < 1. We claim 
Theorem 4. Fix a number x > 1. Then the ratios 

Pn+i{x) 



Pn{x) 

strictly increase with n. 



(n = 0,l,2,...) 



While Theorem m can be proved by induction on n, we use a more general technique which 
shows that the result holds not only for the Legendre polynomials, but for any sequence of 
functions of n that are representable as g{t)^h(t)dt, with positive g, h. See Lemma [1] 
below. 

To prove this we start with a definition and lemma. 
Definition. A function g{t), defined on an interval a < t < 6, is admissible for that interval 
if g{t) > for all t G («,&), and for every finite sequence {xi} of real numbers, not all 0, it 
is true that J^i^igi'tY does not vanish identically on (a, 6). 

Lemma 1. Suppose g{t) is admissible for {a,b), and h{t) > for all t G {a,b). Let fin = 
J g{t)"'h(t)dt, and suppose that all /ij > 0. Then is a strictly increasing function of 
i = 0,1,2,.... ^' 

Proof. Let H be the infinite Hankel matrix {/ij+j}jj>o. Consider the principal submatrix 
formed by the first n rows and columns of H. If xq, Xi, . . . , Xn-i are arbitrary real numbers, 
not all zero, then the quadratic form 

n 

n-l n-1 „ft „ft /n-1 \ 

Qn=J2 ^^Hijxj = J2 ^^^j / g{ty^'Ht)dt = / J2^'9{ty Kt)dt, 

i,j=0 i,j=0 "^^ \i=0 / 

is clearly positive. Hence if is a positive definite matrix, whence its 2 x 2 principal minors 
yW2j/^2i+2 - are all positive, i.e., 

^<^;^<^;... (20) 
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Next replace h(t) by g{t)h{t). Then the sequence {/ii}i>o is replaced by /ij+i {i > 0) and the 
Hankel matrix H is replaced by one whose entry is fii^j^i. We apply the conclusion 
(|20|) to this new situation and we discover that 

^<^;^<^;... (21) 
If we combine fl20p and 0211] we obtain the result stated in Lemma [TJ □ 



To prove Theorem H] we have the integral representation 

1 1"^ , 

P^{x) = - {x + Vx^ -1 COS t^dt (22) 
Jo 

for the Legendre polynomials. We can take g{t) = x + \/x^ — \ cos t and h(i) = 1 in Lemma 
[1] and the conclusion of Theorem H] follows. □ 



We remark that this is the reversal of a celebrated inequality of Turan which holds inside 
the interval of orthogonality. Now that the ratio of the Legendre polynomials on the left 
side of ( !T9l) is known to be a strictly increasing function of n, we observe that when = 
the left side has the value 1 + xy, and when n — )■ oo, the well known asymptotic behavior of 
Pn{x) for fixed x > 1 and large n shows that the left side approaches (1 + ^/xyY, which is 
larger than 1 + (x + 2)y. Hence there is a unique n for which the left side of f|T9|) is < the 
right side, but at n + 1 the inequality is reversed. 

The case where xy > 1 \s reduced to the case xy < 1, which we have just handled, by 
equation ( fT6i) . If xy = 1, i.e., if p + g = 1, we discuss the situation in the next section. The 
proof of Theorem |2] is now complete. □ 



6 The interesting special case p + q = 1 

Consider the special case in which p + q = 1. Then xy = p{l — p)/((l — p)p) = 1, and we 
can carry out the calculations analytically in full. Indeed, we now have 0n(l) = (^^) and 
■?/'„(l) = n(^") /(ra + 1), from which we get 

q , ^ , ^ q f2n\ n f2n\ 
1 — q 1 — q\n J n + l\n J 

q n \f2n\ ^23) 



1 — g n + \ J \n 
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This last vanishes iffg/(l — g) = ?2/(n + l), which is g = 
equals that of g — n/{2n + 1). This proves the following. 



n/{2n + 1). The sign of J„ then 



Lemma 2. 



J n. 



n 



n 



0. 



2n + l 2n + l 

Now the unimodality theorem, Theorem [2], gives an explicit formula for N on this line. 
Theorem 5. (The Diagonal Formula) For < q < 1/2, we have 



Niq, 1 - q) 



2q 



Proof. By the uniqueness theorem, and the tie-breaking aspect of the definition of N{q,p), 
N{q,p) is always the least integer n such that J„ < 0. Because of the agreement of the sign 
of Jn and that of q — n/{2n + l), the result follows. Note that when q = n/{2n + l), the J = 
condition means that there is a tie between the two values \q/ (1 — 2g)] and \q/ (1 — 2g)] + 1 
for the optimal choice. □ 



7 A general lower bound 



Theorem 6. Let N{q,p) denote the optimum choice ofn, i.e., the one that maximizes Alice's 
chance of winning. Then we have 



N{q,p) > 



l2{p-q) 2J 



(24) 



First we need the following 



Lemma 3. In the trapezoid Tn, defined by the lines q = 0, p = q, p+q = 1, and the inequality 
p < l/(2n + l) + g, the indicator function Jn is positive. That is, if p < Ln{q) and {q,p) G T, 
then Jn{q,p) > 0. 

Proof of the Lemma. We use induction on n. Figure 2 shows the trapezoid. Since r„ C r„_i, 
the induction is vahd. The case n = 1 follows easily from Ji(g,p)(l — p — g)/(2g(l — p)) = 
1 — 2 + 3g), Li = 1/3 + g, and q < 1/2. Suppose that J„ < 0. Because p < L„_i(g), the 
induction hypothesis applies, giving J„_i > 0. Therefore r„+i > p > Vn- The ratio recurrence 
tells us that {n + l)r„+i — {2n + l)u + n/rn = 0. Therefore {n + l)p — {2n + l)u + n/p < 0, 
or 1 — {2n + l)p + 2nq + g < 0. Thus l/{2n + 1) + g < p, contradicting p < Ln{q). □ 
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0.5 



Figure 2: The trapezoid r, 



n 



Proof of Theorem [6l Again, suppose first that xy < 1, i.e., that p + q < 1. The theorem 
says that if p < Ln{q), then n cannot be N{q,p). Therefore for n < l/(2{p — q)) — 1/2, n is 
not N{q,p) and N{q,p) > l/(2(p - q)) - 1/2. More precisely, > [l/(2(p - q)) - 1/2] = 
[l/(2(p - q)) + 1/2J. But on the p + g = 1 hue. 



So the latter works as a lower bound in both cases. Finally, if p + g > 1, then the symmetry 
formula of Corollary [1] yields N{q,p) = N{1 — p,l — q), a transformation that leaves the 
bound invariant. □ 

8 The upper bound 

In this section we study in detail the curves J = and use the results to obtain a simple 
upper bound on N{q,p) which is roughly twice the lower bound of Theorem Ei 



N{q,p) = \q/{1 



2q)] = Ll/(2(p-g))-l/2j. 
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Theorem 7. (Upper bound on N) 

Niq,p) < 

p-q 

8.1 Curves on which J„ vanishes 

The key idea underlying our analysis of N{q,p) is an understanding of the vanishing sets of 
Jn- Figure 3 shows these curves, together with some of the lines M„ and 

The upper black curve is Ji = 0, and so on down. The blue lines are M„ and the red 
ones, L„. So we know that below the uppermost black curve Ji > and so iV > 2. In fact, 
the = 1 region is just the region above the first black curve. But now we need to prove 
various properties evident from the diagram. 

8.2 The function Pn{q) 

Our first task will be to define, and to verify the correctness of the definition, of a function 
Pniq) which for each q G (0,n/(2n + 1)], is the unique value of p for which Jn{q,p) = 0. We 
will therefore start by proving existence and uniqueness of such a p. 

Lemma 4. For q G T„, JniQ, 1 — q) < 0. 

Proof. As in eq. (!23|) we have 

\l — q n + 1 J \ n J 

which is negative iff g < n/{2n + 1). □ 
Lemma 5. For q G T„, Jn{<ii <i) > 0. 

Proof. The condition that Jn{q,q) > is the same as r^+i < p = 1/(1 — 2g), so we must 
prove that r„+i < 1/(1 — 2g) when q G T„. We will prove more, namely that r„ < 1/(1 — 2q) 
whenever < g < 1/2. 

Let r be the fixed point that is > 1 of the Legendre polynomial recurrence, i.e. the root 
of the quadratic equation 

(2n + l)u n 

r = — 

n + 1 [n + l)r 

that is > 1. This root is 

J(2n + lYu^ -Anin + l) + u(2n + 1) 

W„ = 

2(n + l) 
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It is easy to check that 1 < Wn < 1/(1 — 2q), and tedious, but routine, to check that 
Wn < Wn+1 for < g < 1/2. 

Now we can prove by induction that r„ < Wn whenever < g < 1/2, which suffices. 
(Note the change from g G T„ to g G (0, 1/2); this is essential to allow the induction to carry 
through.) The base case can be taken to be 



iyi-ri = ^l2g + 2J-9(l-g)g+ ^ ^ 



4(l-2g)2 4 l-2g 

whose positivity is follows from the fact that the expression is positively infinite at g = 1/2 
and only when g = 0. 

For the inductive step, take the recurrence 

{2n + l)u n 



n + 1 (n + l)r^ 
and assume inductively that < Wn- Then 

{2n + l)u n 



^1 < 



n+l {n + l)Wn 



but this last equals Wn, a fixed point of the recurrence. Therefore r„+i < Wn, and the proof 
of Lemma [5] is complete because Wn < Wn+i for < g < 1/2. □ 

The algebraic part of the proof actually yields the more general result that r„ < 1/(1 — 
p-q). 

Theorem 8. (Existence theorem) Given n>l and q < n/{2n + 1), there is a value p with 
q < p < 1 — q such that Jn{(l,p) vanishes. 

Proof. When g < n/{2{n+ 1)) this follows from Lemmas H] and [5], which tell us that Jn{<iiP) 
is a strictly decreasing function of p, going from a positive value to a negative one. At the 
endpoint the fact that J„(g, 1 — n/{2{2n + 1)) = gives the existence of the desired p. □ 



Definition: Given n > 1 and q G Tn, we write p„ for the largest value of p between g and 
1 — g such that Jn{,(l,p) = 0. 

We now turn to the important proof that, in all cases, there is only one value p so that 
Jn{q,p) = 0. The key is the following lemma about dJ/dp. 

Lemma 6. We have 

d 

-^Jn{Q,p) < for g < p < 1 — g. 
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Proof. The derivative inequality, after multiplication by 



-{I - pYp{'^ - qy 



l-n 



q \{l-p){l-q)^ 
becomes 

Pn{u) {n (2p2 - p(2g + 1) + g - 1) _2pg+p + g-l) + (n + - g + 1)P„+i(m) < 0, 

and so holds precisely when r^+i < V, where 

n (2p^ — p{2q + 1) + g — 1) — 2pq + p + q — 1 



V 



{n + l){p + q-l] 



(The inequality was reversed because —{n + 1)(1 — p — g) < 0.) 

Now r„_|_i < V can be proved by an easy induction, since the domain of truth does not 
depend on n. For the base case examine V — ri, which works out to be the positive quantity 

2np{l — p) 



{n + 1)(1 - p - q) 
The induction step uses the usual recurrence. Suppose r„ < V; then 

(2n + l)u n {2n + l)u n 



' n+l 



< 



n + l (n + l)r„ n+l {n + l)V' 



But this last is less than V because the difference V — ((2n + l)u)/{n + 1) — n/{{n + 1)^) 
works out to 

2n{p — g) — 2g + 1 
n (— 2p2 + 2pg + p — q + 1) + 2pq — p — q + 1 

in which all the grouped terms are positive. □ 

Theorem 9. (Uniqueness Theorem) Given n>l andO < q < n/{2n + l). Ifq<p<Pn{q), 
then Jn{q,p) > 0. Thus there is only one vanishing value for Jn- 

Proof. This follows from Lemma|6]which implies that Jn{q,p) is a strictly decreasing function 
of g; thus it cannot return to after it has once taken that value. □ 
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8.3 Properties of J 

Now we have the functions p„ defined on T„, with the property that Jn{(l,Pn{(l)) = 0. We 
will call the graph of p„,(g) a J-nulldine, and denote it simply by p„, or often just p. We 
need several properties of p„. Note that most of the properties below have one-line proofs 
thanks to the efficient definition of Pn and the uniqueness result (Theorem [H]). 

Lemma 7. The function pn is continuously differentiable (C^) on Tn- 

Proof. This is a consequence of the fact that J„ (q', p) is differentiable (it is a rational function) 
and Lemma O which states that > 0. These facts show that the hypotheses of the 

implicit function theorem are satisfied. 

The preceding result is about the triangle T. But it also works on the p + q = 1 line, for 
on that line 



Jn{q,p) = Jn(g, 



q n \ f2n\ fq n \ f2n 



q n + 1 J \n J \p n + 1 J \n 



and the partial derivative -^Jniq,p) is just —q{^n) /p^, which is nonzero. By symmetry the 
same proof works on the opposite side of the p + q = 1 line. □ 

Lemma 8. (Derivative formula) For any point {q,p) G T and on the graph of pn, we have 

i( ^ _p{^-p){f^P-M + P-^) 
Pn^^> - q[i^q)^n{q-p) + q) ' 

Proof. By the implicit function theorem, 



Vn(g,p) 
a^,Jn{q,p)' 



Pn = -^TT— T- (25) 



Taking the derivatives, using the recurrence to eliminate Pn+2, and using the Jn{q,p) = 
relation to eliminate Pn+i in favor of P„ leads immediately to the formula. □ 

Lemma 9. (Linear vanishing condition) p'^iq) = iff Pn H^s on the line Mn- 

Proof: Immediate from the numerator of the derivative formula (Lemma |8]). □ 

Lemma 10. Given n>l and q < p, if J nil-, P) = then the partial derivative {d/dq)Jn{q,p) 
is not zero. 
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Proof. We work first in T. The partial derivative of Jn{(l,p) can be taken and simplified 
using the standard recurrence for P„+2, and also the relationship derived from J„ = 0, to 
replace P„+i by pP„. This leads to 

-Jn{q,P)- 



dq ' (1 - - g + 1) 

The denominator is nonzero and > 1 in T so Pn{u) > 1; therefore the partial derivative 
vanishes at a point on a J-nullcline iff n{p — q)+p~l = OiSp = + = Mn{q)- 
So we must show that this value of p cannot lead to a point at which J„ vanishes. Define 
h{q) = Jn{q, M„(g)). This evaluates to 



-2nq+n—q 
n(g-l)2 



{{n + q)Pn{u) + {n{2q - 1) + g)P„+iH) 



2n(l - g)2 

When g = 0, we have u = 1 and so Pniu) = 1 and h{0) = 0. We also have (limits are from 
the left) 

(71 Tl -\- 1. 
n TT' n TT 
2n + 1 2n + 1^ 

by continuity of J„. But this last vanishes, by the diagonal formula of Theorem |5l Thus 



lim h{q) = 0. 

q^n/(2n+l) 

Hence h{q) is a differentiable function of q which vanishes at both ends of the interval 
(0, n/ {2n + 1)). Figure 4 shows the graphs of h ioi n = 1, 2, 3, 4, 5. It remains to show that 
h cannot vanish for any q ^T^. 

Suppose h{q) vanishes at some g G T„. Then, because of the vanishing at the endpoints, 
there must be a point qi such that h'{qi) = and h{qi) > 0. More precisely, if there is a 
true crossing at q then gi would be given by the Mean Value Theorem applied to either [0, g] 
or [g, n/ {2n + 1)]; if there is a tangency to the axis at g (or even if the function is identically 
0), then gi = g. 

So at the point (gi, M„(gi)) (which we denote by just (g, M„(g)) in the expressions below) 
we would have two relations for r„_|_i: The h{qi) = J„(gi, M„(gi)) > condition means that 

Tn+i < P={l-P + qi)l (1 - p - gi), 

where in the last fraction p is to be replaced by M„(gi); and taking the g-derivative of 
h{q) = Jn{q, Mn{q)) and recursively removing P„+2 leads to the following equation: 

+ 1) [ ^ ~ g ~ 2ng \" CiPnju) + C2Pn+iiu) ^ 

^V^(g-1)2 J 2n(g- l)3(ng + l)(n(2g- 1) + g) ~ ' 
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i 2 3 4 0.5 

3 5 7 9 



Figure 4: The graphs of J„(g, Mn{q)) for n = 1,2, 3, 4, 5. 

where 

Ci = 2n^q + [Aq^ - 4g^ + 5g + l) + n (2g^ + + 2g + l) + g(g + 1), 
C2 = (n + 1)(1 + g + 2ng)(— n + g + 2ng). 
Clearing the nonzero factors tells us that r„+i = —Cxjc^, and therefore 

which reduces to 

2//(l - g)g 

<0, 



(n + l)(2ng + g + 1) 

a clear contradiction, which establishes the theorem for the triangle T . But the same proof 
works on the p + g = 1 line, for on that line 



q n \ ( 2n\ ( q n \ [2n 



1 — q n + 1 J \n J \p n + 1 J \n 



and the partial derivative {d/dq)Jn{q,p) is just which is nonzero. By symmetry the 

same proof works on the opposite side of the p + q — 1 line. □ 

Lemma 11. For q e we have p'^iq) 7^ 0. 
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Proof: Because of fl25l) . the claim follows from Lemma [TU| which shows that the numerator 
does not vanish on the graph of p„. □ 

Lemma 12. 



Proof: The definition of Pn can be carried over by symmetry to the other side of the p + q = 1 
line to yield a differentiable function. The implicit function theorem applies on the line itself 
as noted in the proof. But then, by symmetry of all the probabilities, and therefore of 
the vanishing of J„, Pn is symmetric across the line. Thus differentiability implies that the 
derivative must be 1 to avoid a cusp. □ 

Theorem 10. (Upper bound on Pn) For every q G T„, we have Pn{q) < Mn{q), where Mn{q) 
is the line ^ above. 

Proof: Because Pn and M„ agree on the line p + q = 1 (Lemma [T2l). and because 



(Lemma [T^ while the slope of M„ is n/{n + 1), the fact that Pn is (Lemma [7j) means 
that Pn is under M„ when q is just left of the p + q = 1 line. Lemma [TT] tells us that p'^ is 
never 0, and therefore, by Lemma [9l p„ can never cross the line M„. □ 

Lemma 14. N{q,pn{q)) = n. 

Proof. When {q,p) G T, this follows from Theorem [2] because Jn{q,Pn{q)) = and so 
Jn-i{,q,Pn{,q)) > 0. Therefore n is the least m such that Jm{q,Pn{q)) = 0. If Pn{q) = ^ — Q 
then it must be that q = n/ {2n + 1) and, by the proof of Theorem [21 n = g/(l — 2q) is the 
unique integer such that Jn{q, 1 — = 0, establishing the result. □ 

Lemma 15. For every q we have pi{q) > P2{q) > Psi^) > ■ ■ ■■ 

Proof. The graphs pn can never cross because if Pn{q) = Pm{q), N{q,p) would be simultane- 
ously n and m by Lemma [HI and the right end of p„ is above the right end of Pn+i (Lemma 
[12]). So continuity of the graphs yields the result. □ 




Proof: Follows from the diagonal equation of Lemma [2l □ 
Lemma 13. 
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Lemma 16. The graphs of Pn{q) determine the value of N{q,p) exactly as follows: N{q,p) 
is the least n such that p > Pn{q)- 

Proof. We know this is correct when we are on the graph pn (Lemma [H]). But if {q,p) is 
between Pn~i and pn then J„_i > and J„ < 0, by Theorem [8], and this means N{q,p) = n. 
□ 

Figure 5 shows how the graphs of pn divide triangle T into regions that define the optimal 
A^- values, and how Pn is bounded by the two lines M„ and 

Corollary 2. N{q,p) is a nonincreasing function of p. Thus ifthep-coin becomes stronger, 
then the optimal choice of game length for the holder of the q-coin cannot get larger. 

We can now prove Theorem [71 the upper bound on N{q,p). 

Proof. Assume first that (g, p) G T, in which case the claimed bound is just [(1 — p)/{p — q)] ■ 
Let n be the smallest value so that p lies at or above Mn{q). Then n = \{1 — p)/{p — q)] and 
for this to be a bound we need, by Lemma [T6l that Pn{q) < Mn{q). But this is exactly what 
Theorem [TO] tells us. For the special case on the diagonal line: N{q, 1 — q) = \q/{l — 2g)], 
by the diagonal formula. But this is identical to [(1 ~ p)/{p — q)] , which therefore works in 
both cases. The case in which p + g > 1 is handled by symmetry (see Corollary [T]) , with q 
taking the role of 1 — p. □ 

The two proved bounds in Theorem [3] are equal 47% of the time because they must agree if 
it happens that M„(g) < p < L„_i(g); these conditions define a collection of triangles whose 
area, in proportion to T, is easily computed to be 7r^/4 — 2. In all such cases, then, N{q,p) 
equals \{1 — p)/{p — q)]. 

9 Deeper analysis of the nullclines 

In section [S] we proved many properties of p„ that were evident from the graphs. We continue 
that here, gaining information that leads to improved bounds on N and to an efficient 
algorithm for computing (section [TOj) . We first observe that when n is small, pn is given 
by simple formulas. Such formulas are useful as a check on computations. 

Lemma 17. 



P2{q) 



Pi{q) 



1- q 

2- 3g' 
1-q 



(2 - 4g - -4g + 6g2). 



3 - 12g + 10g2 
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7 



Figure 5: The graphs of Pn separate T into regions where iV = l,2,3,.... The two red hnes 
M3 and L3 form bounds on ps, and this relationship underhes the lower and upper bounds 
on N{q,p). 
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Proof. Just solve the polynomial equation Jn{<liP) = 0. There are more complicated radical 
expressions for p2, Ps, and p^. The last is ostensibly a quintic, but the polynomial in the 
equation is divisible by 1 — p yielding a quartic equation. □ 

Because J„ is infinitely differentiable in T, the fact that the hypothesis of the implicit function 
theorem is met (Lemmas [6] and [7]) means that pn is infinitely differentiable. The second 
derivative is easily computed by implicitly differentiating the derivative formula (Lemma |8]). 

Lemma 18. We have 
where Z is given by 

2n\p - qf + 2n^{2p^ - p^{7q + 1) + pqi7q + 3) - 2q'^{q + 1)) + n{2p^ - ^^(llg + 2) 
+pqillq + 10) - g(2g2 + 7g + 1)) - (p - l)q{3p - 3q - 1). 

But we can prove the weaker and still very useful assertion that the slope never rises 
beyond its value at the right end. 

Lemma 19. For q E Tn we have Pn{q) > 0. 

Proof. Because the slope is 1 at the right end (Lemma [T3l) . but never vanishes (Lemma fTTl) . 
it is always positive. □ 



We next move to a proof of the (computationally evident) fact that the J-nuUclines 
are convex. Figure 6 shows the second derivatives of p„ for n < 15. They are evidently 
unbounded at g = 0, converging to at the right, and always positive. We will now prove 
positivity (i.e., convexity of the J-nuUcline curves), which will be important as a source of 
new approximations to N{q,p). 

Theorem 11. (Convexity) p'^iq) > inT^- 

Proof. By Lemma [3l any point {q,Pn{(l)) lies above Ln{q), so the line connecting {q,Pn{(l)) 
to {n/{2n + 1), 1 — n/{2n + 1)) has slope less than 1. By the Mean Value Theorem, there 
is some qi > q for which p^(g) < 1. But the slope is 1 at n/{2n + 1), so there is a point at 
which the second derivatgive is positive. Since is continuous, the proof will be complete 
once we show that the second derivative cannot vanish. The second derivative is given by 
the formula of Lemma [181 We can eliminate nonzero factors, thus reducing its vanishing to 
the following equation in p. 
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0.1 0.2 0.33 0.4 0.5 

Figure 6: The graphs of p'^io) for < 15. 



[2n^ + An^ + 2n) + {-6n^q - 2n^{7q + 1) - n{llq + 2) - 3q) 
+p {Qn^q^ + 2n^q{7q + 3) + ng(llg + 10) + + 4g) - ng [2q^ + 7g + l) 
-3g^ - g - 2n^g^ - An^q^{q + 1) = 

Fixing n and g, the vanishing condition is a cubic in p. The cubic always has a real root 
and checking the critical points assuming < g < 1/2 and n > 1 shows that they never 
straddle 0, which means that the other two roots are never real. Call the unique real root 
p~{q)- The theorem will be proved once we show that JniQiPnio)) 7^ in T„, so that the 
inflection point is not on the graph of p„. We can get a closed form for p~{q) by solving the 
cubic, but we do not need the explicit representation as the needed algebra can be worked 
out implicitly from the cubic relation. Yet it is instructive to look at p^(q'). Figure 7 shows 
its graph for n < 5, together with the graphs of in pink, and also the base- 10 log of the 
difference for n = 1, 2, 3, 10, 20, 100. The inflection curve is barely below the J-nuUcline and 
the two curves are visually indistinguishable. 

We can use implicit differentiation on the defining cubic to get a formula for -^p~{q)- 
Since Pnio) is given explicitly by radicals we know its derivative exists. The implicit deriva- 
tive formula p^(g) = —dq/dp then gives the following representation for the derivative 
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Figure 7: Left: The graphs of p„ (pink) and p„ (g) (black). Right: The difference in the 
graphs viewed through a logarithmic lens. 

iPniQ)- 

{6n^ + 8n + 3) p2 _ 2p {6n^q + n{8q + 3) + 3g + 2) + 6n{n + 1)^^ + (8n + 6)g + 1 

6n2(p - g)2 + 2n (3p2 - 2p{Aq + 1) + g(4g + 3)) + q{-6p + 3g + 4) ' ^ 

Now we follow the proof idea of Lemma [TOl We need to show that, given n, the point 
il^Pnil)) cannot lie on the J„-nullcline. Define the function h{q) = Jni.l^Vn^l))^ which we 
claim does not vanish in T„. Figure 8 shows the first few graphs of h. 

We observe that h vanishes at both ends. At the left this is because J„(0,p) is always 
(the Legendre terms become just 1). At the right we find that the defining cubic, when q is 
set to n/(2n + l), has the factor 2np — n+p — 1, which means that p = 1 — 72/(2^ + 1) is a root. 
The cubic has exactly one real root in the domain of interest, so 1 — n/{2n+ 1) = ( 2J+1 )) 
and we know (Lemma [2]) that Jn{{n/(2n + 1), 1 — n/{2n + 1)) = 0. 

Now suppose the graph is at some qi E Tn. Then looking left and right of qi and using 
the Mean Value Theorem, we get a value, call it q, such that h{q) < and h'{q) = 0. Recall 
that h{q) < means that r„+i = Pn+i/ Pn ^ P = i^—p+l)/ {^~P~q), where p denotes p^(g). 
When we form h'{q) leaving p~ {q) undefined and then substitute the derivative formula fl26l) 
for the derivatives of p~{q) that appear, we obtain the familiar form ciP„ + C2P„+i = 0. This 
becomes — Ci/c2 = Tn+i, so we have the relation — Ci/c2 > p. If we work out ci,C2 in terms 
of n, j9, q we get a rational function of these three variables which is too large to reproduce 
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Figure 8: The graphs of Jn{<l,Pn n < 5. 

here, but which can be found at |7]. A call to Mathematica's Reduce function shows, in two 
seconds, that the five conditions: 

1. -Ci/C2 > p, 

2. the vanishing of the second derivative formula at p, 

3. 0<q<p<l-q, 

4. n > 1, and 

5. g<n/(2(n + l)) 

are contradictory. For such polynomial systems Reduce uses a cylindrical algebra decompo- 
sition this example requires showing that there is no solution in each of 1062 cylindrical 
cells. Working this out by hand might be extremely difficult, if not impossible. □ 

Corollary 3. For q G T„, < 1. 

Proof. The second derivative is positive in T„, so the first derivative is always less than its 
value of 1 at n/{2n + 1). □ 

While the original probability formulation makes no sense when g = (there is no optimal 
choice of N when the underdog loses each play), the limit \img^QPn{q), evident from Figure 
3, is quite simple. 
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Lemma 20. limg^o+ P„(g) = ^ 

Proof. Because p'nil) > (Lemma [T9|) . the values decrease as g — )■ 0; the values are 

bounded and so the claimed limit exists. Thus we will use p(0) to denote \img^Q+ Pn{q) ■ 
Further, < p{0) < 1, for by Lemma [31 p„(g) > l/(2n + 1) + g so p(0) > l/(2n + 1); and 
p(0) < 1— n/(2?7, + l) because the derivative is positive and j9„(n/(2?T, + 1)) < 1 (Lemma [T2|) . 
Now, the derivative formula tells us that 

{{p - l)p{np - nq + p - 1)) _ ^ ^ 
{{q - l)q{n{q - p) + q)) 
where p denotes p„. Multiplying both sides by the denominator turns this into 

{p — l)p{np — nq + p — 1) — {q — l)q{n{q — p) + q)p\q) = 0. 
But the boundedness of p'^ means that the limit of qp'{q) is 0, giving 

lim p{p — l){np + p — 1) = 0, 

g-i>0+ 

or (j9(0) — l)p(0)(p(0)(?7. + l) — 1) = 0. Only the last factor can vanish, giving p(0) = l/{n + l). 
□ 



Corollary 4. (Slope convergence) The limit limg^o+ p'^{q) exists. 

Proof. The slopes are bounded below (Lemma and monotonic by the convexity of p„. □ 
Corollary 5. (Slope-limit formula) \imq^Q+ p'^{q) = n/{2{n + 1)). 

Proof. Let S denote the limit. Because the derivative formula p' = ((1 — p)p{np ~ nq + p — 
1))/ ((1 — q)q{nq — np + q)) holds for the slopes, we want the limit of this expression as g — i- 0. 
Now, as g — )■ 0, p — 7- l/(n + 1). So we can look at the numerator and denominator separately 
and see that we have a 0/0 form, and we can use I'Hopital's rule to get the limit, where we 
use p(g) for p. Forming the I'Hopital quotient, using limp(g) = l/(n + 1), limp'(g) = S, and 
then letting g be yields (n — (n — l)^) /(n + 1). Setting this to equal to 5* and solving gives 
the formula. □ 

Corollary [5] allows us to think of function on all of M as follows. Let Sn be the 

limit of the slopes that the corollary provides. Define to agree with p„ on [0,n/(2?7, + 1)) 
and to be the linear function through (0,p„(0)) of slope Sn on (— oo,0], and the similar 
tangent-line extension on the right. It is easy to see using the Mean Value Theorem and 
the hmit definition of Sn that the limit of the slopes of the secants connecting (0,p„(0)) to 
{q,Pn{q)) is Sn, giving the continuous differentiability of p*. 
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Corollary 6. (Slope hound) p^il) > 2(n+i) 

Proof. By the previous corollary, because the slopes are monotonically increasing, by con- 
vexity. □ 

Corollary 7. (Linear lower bound) In Tn, Pn > Kn{q) — + 2(n+i) ^' ^^'^ 



Proof. The linear function Kn agrees with p„ at g = by Lemma [201 If Pn clipped below 
Kn, the Mean Value Theorem would provide a point contradicting Corollary [61 Inverting 
the bound on p gives a bound on A^. □ 

The upper bound on N given by Theorem [71 and the piecewise lower bound obtained by 
combining Theorem [6l and Corollary [71 are useful computationally. For when the two bounds 
agree we know immediately that N{q,p) = \l/{2{p — q)) + 1/2J. And in cases where N 
equals the piecewise lower bound, that can be verified by a single J-evaluation: just check 
that Jn{q,p) is negative when n is the lower bound. The subset of T for which the two 
bounds agree is a union of triangles - the green region in Figure 9 - and the total area of 
these triangles can be determined by integrating Mathematica's Boole function to get an 
expression for each level and then summing the results symbolically. The total area of the 
region in which equals the lower bound can be approximated by experimentation using 
thousands of points. The results are summarized in the next theorem. 

Theorem 12. 1. For{q,p) chosen uniformly from T, the probability that the upper bound 
of Theorem^ and the combined lower bounds of Theorem\^ and Corollary\^ agree is 



where ip is the dig amma function T'/T, and'y is Euler's constant. This is roughly 0.60. 
2. For {q,p) as above, the probability that 



N{q,p)> [(1 



p)/{p 



g/2)l 



(1 - i)ij{3 - + (1 + i)^{3 + + — + 27 



115 




(27) 



is approximately 0.87. 
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Corollary 8. (Improved bounds on N) In T , N {q,p) < N{q,p) < N^{q,p), where N and 
N'^ are, respectively, the positive-radical solutions, n, to the quadratic equations: 

2n^{j) — qY + 2n [p^ — 3pg — p + q^ + 2g) (p — 9) — (1 — — 3p + 3g) = 0, 

and the equation DF= n/{2{n + 1)), where DF is the derivative formula of Lemma\^ 

Proof. For the lower bound, the equation is equivalent to setting the second derivative 
formula (Lemma [T5]) to and clearing nonzero terms. First define 

ker = 2n^{p - qf + 2n {p^ - p'^{Aq + 1) + pg(4g + 3) - + 2)) - {p - l)g(3p - 3g - 1), 

the result of clearing nonzero terms from the second derivative formula. Then the second 
derivative formula vanishes iff ker does. Further, dnker > in T when we add the condition 
Ln{q) < P- Because N{q,p) is not less than the least n such that p'^iq) = 0, the result 
follows. The upper bound is obtained the same way, using the fact that the slope is not less 
than n/{2{n + 1)), which becomes a quadratic relation. □ 

Expanding the rational expression for in a series in powers oi p — q shows that it equals 
l/(2(p — g)) — 3/2 + l/(4p(l — p)) + 0{p — q), which relates it nicely to the simpler bound 
of Theorem O Define H{q,p) = [l/(2(p — g)) — 3/2 + l/(4p(l — p))] ; while H is not a 
lower bound on N, it is a useful approximation when p is close to q and appears to be 
asymptotically perfect when the domain is rescaled (see subsection 19. ip . 

One can view the quadratic equations that give A^^ as cubic equations in p, in which 
case solving gives radical expressions for p^, which bracket the curve p„. Such bounds are 
useful when generating images such as Figure 13 and also theoretically, as they are used in 
the proof of Theorem [T31 

We can measure how good the improved bounds are by assuming that the point {q,p) 
is uniformly distributed in T. Then one can ask: (1) How often does N = (2) How 

often does N = N7 The answers are: "remarkably often." Figure 10 shows points in T 
colored green if both bounds agree, red if the lower bound is correct, and yellow otherwise. 
The upper bound is sometimes correct, but not often. Of course, when the bounds agree 
we know immediately, and when the lower bound is correct, then that can be proved by 
verifying that JN-{q,p) < (more on computing J in section 10 below). The blue curves 
are the graphs of p^; they bracket pi, defined by the yellow-red boundary. In Figure 10 the 
green area where the two bounds agree is 75% of the triangle and the green-plus-red area 
where N equals the lower bound is 97% of the area. 
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Figure 9: The green region is where the two bounds on agree; the red region is where 
agrees with the lower bound. Thus in the combined red and green regions N{q,p) is 

expressible exactly as max 



"l-p" 
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0. 0.1 0.2 0.3 0.4 0.5 



Figure 10: The green region is where = A^^; red is where N = N < , and yellow is 
the rest. The blue curves are the graphs of pj~ and pf. 

9.1 A harmonic rescaling 

The views of Figures 9 and 10 do not show clearly what happens near the p = q line. We 
can take a microscope to that area by harmonically rescaling the domain. We do this by 
first rotating T clockwise 45° and then stretching out the vertical scale: precisely, after the 
rotation we change each y-coordinate to —{l/{2^/2y)) — 1/2. This is essentially just a change 
of coordinates from {q,p) to {p + q,^/{p — q))] Figures 11 and 12 show the view through this 
microscope. The two approximations and H exactly equal N a large percentage of the 
time. For the region defined by < 5000 we found that = A^~ in > 99% of the region 
while A^ and H agree 96% of the time. Thus we can conjecture that the probability of either 
equation holding is asymptotically 1. 
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Figure 11: A rotated and vertically stretched view of the {q,p) domain. The colors have the 
same meanings as in Figure 10. We see here that the region (yellow) where is not equal 
to is very small. 

9.2 The situation when p and q are close 

The various diagrams suggest we look more closely at the situation near the p = q line. The 
structure can be discerned by looking at the regions in which the excess of N{q,p) over the 
simplest lower bound (Theorem[3]) is constant. From this we will obtain the result that for any 
point (g, q) there is an integer 6 such that, close to (g, q), we have N{q,p) — [ 2{p-g) — 

Definition. For (g,p) G T let A{q,p) = N{q,p) — [ 2{p-q) l\- amount by which the 
optimal n exceeds the lower bound derived from L„ < Pn- 

Suppose {q,p) is such that Pn+j{q) < p < Ln{q) and also < p. The first inequality 

tells us that n+l<N<n + j. The last inequality means that the lower bound derived 
from L„ is n + 1. So A < j — 1. Now we can profitably examine the regions determined by 
the intersection points of the L-lines with the nullclines Pn+j- Getting the intersection points 
is easy numerically using the function p„, computed by a differential equation (section [TOj) : 
they are plotted as large black joined points in Figure 13. Note that each L„ is tangent to 
Pn at its right edge, strikes p2n at its left, and, because the slope of each nuUchne is under 1 
(Lemma [T9|) . hits each in-between p-graph in exactly one point. 

Figure 13 tells the story. In the left image the colored regions correspond to constant 
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Figure 12: A rotated and vertically stretched view of the {q,p) domain in the region where 
< 50. Here red is where N{q,p) = H{q,p), yellow where they are not equal. The 
proportion of the space in which H is correct is 76%, but it appears to converge to 100% as 
N ^ oo. 

values of A. The uppermost black arc connects points common to: Li and p2', L2 and p^] and 
in general L„ and Pn+i- The second-highest black arc connects points common to: L2 and 
P4; L3 and J95; L4 and pq; in general Ln+i and Pn+s- These arcs, which we cannot compute 
without computing p„, divide T into regions (right image of Figure 13) in each of which A 
takes on only two values. 

Definition. For n,j > 1, let be the g- value of the intersection point of Ln+j+i and 
Pn+2j-i and let q~,j be the same with with p replaced by the lower bound Pn^2j-i derived 
from the convexity theorem (Corollary [8]). 

Thus the arcs in Figure 13 are obtained by fixing j and joining the points determined 
by {qn,j} as n = 1, 2, 3, . . .. Because we used a lower bound on p, we know that q~j < qnj- 
Further, we can express q~j quite simply by solving the equation ker = (see Corollary [8]) 
after substituting p = L„+j_i(g), 




2j + 2n-l 
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Figure 13: The left image shows regions where A is constant (between the red J-nullchnes 
and black lines L„). The right image shows how the dividing arcs separate T into regions 
where the discrepancy from the lower bound takes on one of two values. 
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The limit of this expression is easy to find; we use it to define Qooj = |(1 ~ a/j/ (j + !))• A 
derivative shows easily that g„ j is increasing in n, and therefore qnj approaches Qooj from 
the left. The points (goo,i, ?oo,j) are the limits of the arcs in Figure 13; we now turn to a 
proof of this useful fact. Because q~j < qnj and approaches the limit from the left, we need 
only show that g„j < qooj- This inequality is not true in general (though it does appear to 
be true when j = 1) but we can show that, for each fixed j, it is true for sufficiently large n, 
and that suffices for the limit. 

Theorem 13. For fixed j and sufficiently large n, qn j < qoo,j- 

Proof. To say that qnj is to the left of qooj is the same as saying that Ln+j-i{qoo,j) lies above 
Pn+2j-i{qoo,j)- This in turn is the same as the assertion that Jn+2j-i{,qoo,ji Ln+j^i{qoo,j)) < 0. 
And this is equivalent to r„+2i(w) > p, where goo,j and Ln+j-i{qoo,j) are used for q and p in 
p and u. 

Now define g{n) = (^")4-" and let 



v=0 \j=l ^ , 



2\-u- 



1 



2j-l 
I - Ij 4 

This is the asymptotic series of Laplace-Heine [Sj Thm. 8.21.3] for the Legendre polynomials 
P„(m) for u> 1. Here z = u + yjv? — 1. Thus 

P„(«) = P;jtz) + 0(n— i^"). 

Therefore, forming the ratio, we have 



To show that rn+2j{u) — p > we must show that 



' n+2j 



2n + Ajn~l + Af-2^j{j + l) \ ^ 2(j + l)(n + j - 1) 



2(1 +J + Vj(JTI)(2n + 2j - 1)) ; V7(JTI)(2n + 2j - 1) - (j + 1) 



We can now apply the asymptotic series using three terms (m = 3), and take three terms 
of the Taylor series of the result, centered at oo. Fewer than three terms are not sufficient. 
When we do this using Mathematica^s Series command and some further simplification we 
get the following expression, whose positivity concludes the proof- 



r^^,,iu) -P=lij + 1? (l - Vj/U + 1)) + Oin-'). 



□ . 
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Given q; let jo{q) = [(1 "~ 2g)^/(4g(l — g))], the largest j such that gooj < 9, let no(j) be 
the least integer n guaranteed by the theorem, i.e., < qooj for n > uq. It appears that 
uq < 3j^, but we have no proved bound. 

Corollary 9. If q < p < i^noOofa)) then N{q,p) < + U + Fi^^l ■ 

Proof. The definition of uq means that as one starts at the point {q, Lno{jo{q))) and moves 
down, one strikes, in alternating order, the graphs of Pn and the lines L„. This means that 
for these points A equals its value just under (g, or 1 greater. Since the value of 

A just below the line L„g is at most [(1 — 2g)^/(4(l — q)q)'\ — 1 (see comments at start of 
this section), the result is proved. □ 

Corollary 10. Given qo, with < qo < \, we have lim(p^g)_j.(g(,^gQ)(g — p)N{q,p) = | 
Proof. Immediate from the previous corollary, which implies that 

- < N{q,p) < + ^- if + 2. □ 



{2{p-q)) 2- ^"'"^-2(p-g) 4(1 - g)g 

If in Theorem [13] one replaces the sharp q^oj by simply l/(2j + 1) one obtains a much 
weaker theorem, which can be proved in the same way; i.e., g^j < l/(2j + 1) for sufficiently 
large n. However, in this case it appears that the result is true for all n, as is evident from 
Figure 13. A proof of this would yield a new upper bound on N{q,p), weaker than the one 
in Corollary [HI but with the advantage of being true for all {q,p)- 

Open problems. Prove that r„+2j('u) < p, where l/(2j + 1) and L„+j_i(l/(2j + 1)) are, 
respectively, used for q and p in p and u. Find estimates for no{j). Show that no(l) = 1. 



10 An algorithm for computing N[q^p) 

A straightforward algorithm to compute N{q,p) first uses symmetry to restrict to T and 
then finds the smallest integer n such that p > Pn{(l)] that value of n is N{q,p) by Lemma 
[T6l One can start with the simple or improved bounds and then use either bisection or the 
secant method, repeatedly checking whether Jniq,p) is positive or negative. This method 
works fine when N is of modest size, but when N is large the Legendre polynomials cannot 
be explicitly computed. A solution is to use the integral formula given in (!22|) . which is a 
fine substitute for P„. That formula means that we can determine the sign of Jn for each 
trial by using numerical integration on 

iu + Vu"^ — ! cos t j dt — {1 — p — q) / iu + Vu'^ — l cos t ) dt. 
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Of course, high-precision must be used as appropriate. One needs enough accuracy to 
account for the full precision of n which will be used as a trial in the root-finding process. 
Further, the expression yju^ — 1 can be numerically unstable for extreme values of p and q 
and one should use the equivalent form 2y/ (1 — p)p{l — q)q/ [1 — p — q). 

But one needs only the sign of the integral above. In Mathematica this means that when 
computing the integral numerically one needs a large working precision, but the accuracy 
goal can be quite small. The method is robust and takes only a few seconds to compute 
Ar(10-io°, 2 • 10-i°0), which is 

72768 90317 94675 98852 95987 53552 38752 84521 10838 88022 00705 28794 63897 19626 49789 
77512 24788 32188 39061 36928. 



Q 


p 


N{q,p) 


10-5 


2 • 10-5 


72768 


10-10 


2 • 10-10 


7276890317 


10-15 


2 • 10-15 


727689031794675 


10-20 


2 ■ 10-20 


72768903179467598852 


10-25 


2 • 10-25 


7276890317946759885295987 


10-30 


2 • 10-30 


727689031794675988529598753552 



Table 1. The integration algorithm allows one to get giant values of N{q,p). 

When one wants not just the sign of J^, but a numerical approximation to the full J„- 
nullcline - the graph of p„ - one can use a numerical differential equation approach. Because 
of the derivative formula and the known values at 0, we can set up the initial- value problem 
as p{0) = l/(n + 1) and 

^ p{l - p){np - nq + p - 1) 
^^^^ qil - q){n{q - p) + q) 

if g > and n/(2(n -|- 1)) if g = 0. This approach is a quick way to generate graphs of Pn, 
such as those shown in various figures in this paper. It can also be used in an algorithm for 
computing N{q,p) where it can sometimes be faster than the use of numerical integration 
because the solution is needed only on the interval [0,q], while the integrals are computed 
from to TT. 
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11 Some open questions 

1. Improve the bounds on N{q,p). 

2. Find a more efficient algoritfim for computing N{q,p) when {q,p) is near tfie origin. 

3. Generalize, in a natural way, these results to the case of three players. 

4. Prove the second derivative conjectures, which we obtained heuristically by manipu- 
lating Taylor polynomials: 

(a) 

+ 5n + 2 



lim p''(q) — ; r- 

q^o+^''^^' 6(n + l) 

and 

(b) 



^ 4(2n + l) 
hm p^{q) = 



H^)- n{2n^ + n-l) 

5. (Asymptotic closed form conjectures) 

(a) If q and p are chosen uniformly from the harmonically rescaled domain then the 
probability that N{q,p) = N~{q,p) approaches 1 as — > oo. 

(b) Same as above, but with the conclusion that N = |'l/(2(p — q)) — 3/2 + l/(4p(l — p))~\ 
with asymptotic probability 1. 
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